Back

G3: Genes, Genomes, Genetics

Oxford University Press (OUP)

All preprints, ranked by how well they match G3: Genes, Genomes, Genetics's content profile, based on 222 papers previously published here. The average preprint has a 0.03% match score for this journal, so anything above that is already an above-average fit. Older preprints may already have been published elsewhere.

1
A Cytological F1 RNAi Screen for Defects in Drosophila melanogaster Female Meiosis

Gilliland, W. D.; Bowen, A. O.; May, D. P.; Conger, K. O.; Elrad, D.; Marciniak, M.; Mashburn, S. A.; Presbitero, G.; Welk, L. F.

2024-01-15 genetics 10.1101/2024.01.12.575435 medRxiv
Top 0.1%
22.7%
Show abstract

Genetic screens for recessive alleles induce mutations, make the mutated chromosomes homozygous, and then assay those homozygotes for the phenotype of interest. When screening for genes required for female meiosis, the phenotype of interest has typically been nondisjunction from chromosome segregation errors. As this requires that mutant females be viable and fertile, any mutants that are lethal or sterile when homozygous cannot be recovered by this approach. To overcome these limitations, our lab has screened the VALIUM22 collection produced by the Harvard TRiP Project, which contains RNAi constructs targeting genes known to be expressed in the germline in a vector optimized for germline expression. By driving RNAi with GAL4 under control of a germline-specific promoter (nanos or mat-alpha4), we can test genes that would be lethal if knocked down in all cells, and by examining unfertilized metaphase-arrested mature oocytes, we can identify defects associated with genes whose knockdown results in sterility or causes other errors besides nondisjunction. We screened this collection to identify genes that disrupt either of two phenotypes when knocked down: the ability of meiotic chromosomes to congress to a single mass at the end of prometaphase, and the sequestration of Mps1-GFP to ooplasmic filaments in response to hypoxia. After screening >1450 lines of the collection, we obtained multiple hits for both phenotypes, identified novel meiotic phenotypes for genes that had been previously characterized in other processes, and identified the first phenotypes to be associated with several previously uncharacterized genes.

2
A framework for evaluating edited cell libraries created by massively parallel genome engineering

Cawley, S.; Abbate, E.; Abraham, C. G.; Alvarez, S.; Barber, M.; Bolte, S.; Bruand, J.; Church, D. M.; Davis, C.; Estes, M.; Federowicz, S.; Fox, R.; Gander, M. W.; Garst, A. D.; Gencer, G.; Hardenbol, P.; Hraha, T.; Jain, S.; Johnson, C.; Juneau, K.; Krishnamurthy, N.; Lambert, S.; Leland, B.; Pearson, F.; Ray, J. C. J.; Sanada, C. D.; Shaver, T. M.; Shepherd, T. R.; Spindler, E. C.; Struble, C. A.; Swat, M. H.; Tanner, S.; Tian, T.; Wishart, K.; Graige, M. S.

2021-09-23 genomics 10.1101/2021.09.23.458228 medRxiv
Top 0.1%
21.9%
Show abstract

Genome engineering methodologies are transforming biological research and discovery. Approaches based on CRISPR technology have been broadly adopted and there is growing interest in the generation of massively parallel edited cell libraries. Comparing the libraries generated by these varying approaches is challenging and researchers lack a common framework for defining and assessing the characteristics of these libraries. Here we describe a framework for evaluating massively parallel libraries of edited genomes based on established methods for sampling complex populations. We define specific attributes and metrics that are informative for describing a complex cell library and provide examples for estimating these values. We also connect this analysis to generic phenotyping approaches, using either pooled (typically via a selection assay) or isolate (often referred to as screening) phenotyping approaches. We approach this from the context of creating massively parallel, precisely edited libraries with one edit per cell, though the approach holds for other types of modifications, including libraries containing multiple edits per cell (combinatorial editing). This framework is a critical component for evaluating and comparing new technologies as well as understanding how a massively parallel edited cell library will perform in a given phenotyping approach.

3
Letter to the Editor regarding "Long-read genome sequencing provides novel insights into the harmful algal bloom species Prymnesium parvum" by Jian et al. (2024)

Wisecaver, J.; Jeje, T.; Watervoort, N. F.

2026-01-23 genomics 10.64898/2026.01.21.699772 medRxiv
Top 0.1%
19.4%
Show abstract

Jian et al. (2024) describe de novo genome assemblies for two strains of Prymnesium parvum (sensu lato, s.l.), a cryptic species complex of toxic, unicellular algae responsible for harmful algal blooms around the world. Here, we present evidence that the labels for UTEX 2797 and CCMP 3037 were inadvertently swapped by Jian et al. (2024). This resulted in sequence data labeled "UTEX 2797" but derived from strain CCMP 3037, and vice versa. Strain misidentification is a major risk with cryptic species like P. parvum s.l., and our reanalysis of the data in Jian et al. (2024) underscores the urgent need for clade-specific markers to ensure accurate and efficient strain identification.

4
A genome-wide MAGIC kit for recombinase-independent mosaic analysis in Drosophila

Shen, Y.; Yeung, A. T.; Ditchfield, P.; Korn, E.; Clements, R.; Chen, X.; Wang, B.; Sheen, M.; Jarman, P.; Han, C.

2025-07-02 genetics 10.1101/2025.06.30.662354 medRxiv
Top 0.1%
18.3%
Show abstract

Mosaic analysis has been instrumental in advancing developmental and cell biology. Most current mosaic techniques rely on exogenous site-specific recombination sequences that need to be introduced into the genome, limiting their application. Mosaic analysis by gRNA-induced crossing-over (MAGIC) was recently developed in Drosophila to eliminate this requirement by inducing somatic recombination through CRISPR/Cas9-generated DNA double-strand breaks. However, MAGIC has not been widely adopted because gRNA-markers, a required component for this technique, are not yet available for most chromosomes. Here, we present a complete, genome-wide gRNA-marker kit that incorporates optimized designs for enhanced clone induction and more effective clone labeling in both positive MAGIC (pMAGIC) and negative MAGIC (nMAGIC). With this kit, we demonstrate clonal analysis in a broad range of Drosophila tissues, including cell types that have been difficult to analyze using recombinase-based systems. Notably, MAGIC enables clonal analysis of pericentromeric genes and deficiency chromosomes and in interspecific hybrid animals, opening new avenues for gene function study, rapid gene discovery, and understanding cellular basis of speciation. This MAGIC kit complements existing systems and makes mosaic analysis accessible to address a wider range of biological questions. IMPACT STATEMENTA comprehensive toolkit enables genome-wide, recombinase-independent mosaic analysis in Drosophila, permitting clonal analysis of pericentromeric genes, deficiency chromosomes, and interspecific hybrids previously inaccessible to standard methods.

5
Assessing the adaptative potential to temperature and precipitation along a steep environmental gradient in populations of European beech

Tost, M.; Grigoriadou Zormpa, O.; Wilhelmi, S.; Mueller, M.; Wildhagen, H.; Curtu, A. L.; Gailing, O.

2025-06-06 genetics 10.1101/2025.06.03.657567 medRxiv
Top 0.1%
14.8%
Show abstract

Climate change poses a significant threat to European beech. These concerns highlight the need to assess the adaptive potential of European beech populations to climate change. Landscape genomics, also known as environmental association analysis, is a powerful tool for identifying gene loci that contribute to local adaptation to environmental pressures. Genotypic data was collected from [~]100 adult beech trees per stand in five locations in the South-Eastern Romanian Carpathians along an altitudinal gradient associated with precipitation and temperature. In total, 53 environmental variables, comprising frost frequency change, temperature and precipitation, were extracted from the climatology data base CHELSA. Based on these variables the Ellenberg-Quotient (EQ) was calculated. We performed environmental association analysis using LFMM (latent factor mixed models) to identify Single Nucleotide Polymorphism (SNP) markers associated with environmental variables and with the principal components calculated based on these. We identified 446 SNP markers significantly associated with the first principal component (PC). These were overlapping with the SNP markers significantly associated with all environmental variables except precipitation accumulated during the growing season. The first PC was correlated with all temperature-based variables and elevation at |r| [~]0.989 to [~]0.997 and with all precipitation-based and Ellenberg-Quotient variables at |r| [~]0.945 to 0.950, except precipitation accumulated during the growing season. A high peak region on chromosome 2 from [~]4.56 to [~]16.27 Mb appeared in all results. This region was [~]3.47 Mb downstream from a region for local adaptation identified by Lazic et al. (2024). In this peak, 273 markers located in the coding region of 22 genes were found. Ten out these 22 were described based on a literature review. Among these ten genes, two may be involved in local adaptation based on our literature review. These two genes are polygalacturonase QRT3-like and NRT1/PTR_FAMILY 5.4-like. The gene polygalacturonase QRT3-like plays a role in pollen development in Arabidopsis thaliana L. and Brassica rapa L. We observed at the corresponding SNP markers, a correlation of the minor allele frequency and temperature-based environmental variables.

6
Long-read genome assemblies for the study of chromosome expansion: Drosophila kikkawai Drosophila takahashii, Drosophila bipectinata, and Drosophila ananassae

Leung, W.; Torosin, N.; Cao, W.; Reed, L. K.; Arrigo, C.; Elgin, S. C. R.; Ellison, C. E.

2023-05-24 genomics 10.1101/2023.05.22.541758 medRxiv
Top 0.1%
14.7%
Show abstract

Flow cytometry estimates of genome sizes among species of Drosophila show a 3-fold variation, ranging from [~]127 Mb in Drosophila mercatorum to [~]400 Mb in Drosophila cyrtoloma. However, the assembled portion of the Muller F Element (orthologous to the fourth chromosome in Drosophila melanogaster) shows a nearly 14-fold variation in size, ranging from [~]1.3 Mb to > 18 Mb. Here, we present chromosome-level long read genome assemblies for four Drosophila species with expanded F Elements ranging in size from 2.3 Mb to 20.5 Mb. Each Muller Element is present as a single scaffold in each assembly. These assemblies will enable new insights into the evolutionary causes and consequences of chromosome size expansion.

7
An improved ChEC-seq method accurately maps the genome-wide binding of transcription coactivators and sequence-specific transcription factors

Donczew, R.; Lalou, A.; Devys, D.; Tora, L.; Hahn, S.

2021-02-12 genomics 10.1101/2021.02.12.430999 medRxiv
Top 0.1%
14.4%
Show abstract

Mittal and colleagues have raised questions about mapping transcription factor locations on DNA using the MNase-based ChEC-seq method (Mittal et al., 2021). Partly due to this concern, we modified the experimental conditions of the MNase cleavage step and subsequent computational analyses, resulting in more stringent conditions for mapping protein-DNA interactions (Donczew et al., 2020). The revised method (dx.doi.org/10.17504/protocols.io.bizgkf3w) answers questions raised by Mittal et al. and, without changing earlier conclusions, identified widespread promoter binding of the transcription coactivators TFIID and SAGA at active genes. The revised method is also suitable for accurately mapping the genome-wide locations of DNA sequence-specific transcription factors.

8
Chromosome-scale scaffolding of the fungus gnat genome (Diptera: Bradysia coprophila)

Urban, J. M.; Gerbi, S. A.; Spradling, A. C.

2022-11-04 genomics 10.1101/2022.11.03.515061 medRxiv
Top 0.1%
14.2%
Show abstract

BackgroundThe fungus gnat, Bradysia (Sciara) coprophila, has compelling chromosome biology. Paternal chromosomes are eliminated during spermatogenesis whereas both maternal X sister chromatids are retained. Embryos start with three copies of the X chromosome, but 1-2 copies are eliminated from somatic cells as part of sex determination, and one is eliminated in the germline to restore diploidy. These developmentally normal events present opportunities to study chromosome movements that are unusual in other systems. To support such studies, we previously generated a highly contiguous optical-map-scaffolded long-read assembly (Bcop_v1) of the male somatic genome. However, the scaffolds were not chromosome-scale, the majority of the assembly lacked chromosome assignments, and the order and orientation of the contigs along chromosomes remained unknown. FindingsMale pupae Hi-C data was used to correct, order, and orient the contigs from Bcop_v1 into chromosome-scale scaffolds, producing the updated assembly, Bcop_v2. Several orthogonal analyses allowed us to (i) identify the corresponding chromosome for each scaffold, (ii) orient them with respect to polytene maps, and (iii) determine that they were highly concordant with the chromosomes they represent. Gene annotations produced for Bcop_v1 were lifted over to Bcop_v2. Chromosomal repeat distributions highlight a potential telomeric sequence. Finally, the Hi-C data shed new light on three "fold-back regions" seen to physically interact in images of polytene X chromosomes. ConclusionsStudies of the unusual chromosome movements in Bradysia coprophila will benefit from the updated assembly (Bcop_v2) where each somatic chromosome is represented by a single scaffold.

9
Genome and Genetic Engineering of the House Cricket (Acheta domesticus) Applications for Sustainable Agriculture

Dossey, A. T.; Oppert, B.; Chu, F.-C.; Lorenzen, M. D.; Scheffler, B.; Simpson, S.; Koren, S.; Johnston, J. S.; Kataoka, K.; Ide, K.

2022-12-16 genomics 10.1101/2022.12.14.520443 medRxiv
Top 0.1%
14.2%
Show abstract

The house cricket, Acheta domesticus, is one of the most farmed insects worldwide and the foundation of an emerging industry for the use of insects as a sustainable food source. Edible insects present a promising alternative for protein production amid a plethora of recent reports on climate change and biodiversity loss largely driven by agriculture. As with other agricultural crops, genetic resources are needed to improve crickets for food and other applications. We present the first high quality annotated genome assembly of A. domesticus which was assembled from long read data and scaffolded to chromosome level from long range data, providing information on promoters and genes needed for genetic manipulation. Gene groups that may be useful for improving the value of these insects to farmers were manually annotated, mainly genes related to immunity. Metagenome scaffolds in the A. domesticus assembly, including those from bacteria, other microbes and viruses such as Invertebrate Iridescent Virus 6 (IIV6), were submitted in a separate accession as host-associated sequences. We demonstrate both CRISPR/Cas9-mediated knock-in and knock-out of selected genes and discuss implications for the food, pharmaceutical and other industries. RNAi was demonstrated to disrupt the function of the vermilion eye-color gene to produce a useful white-eye biomarker phenotype. We are utilizing these data to develop base technologies and methodologies for downstream commercial applications, including the generation of more nutritious and disease resistant crickets as well as lines producing valuable bioproducts such as vaccines and antibiotics. We also discuss how this foundational research can play a critical role in utilizing the largest, most diverse yet almost entirely untapped biological resource on Earth: Class Insecta. Significance StatementSequencing and assembly of the genome of the house cricket has led to improvements in farmed insects for food, pharmaceutical and other applications.

10
Novel and improved Caenorhabditis briggsae gene models generated by community curation

Moya, N. D.; Stevens, L.; Miller, I. R.; Galindo, J. L.; Bardas, A. D.; Yeo, C.; Rozenich, A. J.; Xu, M.; Koh, E. S. H.; Andersen, E. C.

2023-05-18 genomics 10.1101/2023.05.16.541014 medRxiv
Top 0.1%
14.1%
Show abstract

BackgroundThe nematode Caenorhabditis briggsae has been used as a model for genomics studies compared to Caenorhabditis elegans because of its striking morphological and behavioral similarities. These studies yielded numerous findings that have expanded our understanding of nematode development and evolution. However, the potential of C. briggsae to study nematode biology is limited by the quality of its genome resources. The reference genome and gene models for the C. briggsae laboratory strain AF16 have not been developed to the same extent as C. elegans. The recent publication of a new chromosome-level reference genome for QX1410, a C. briggsae wild strain closely related to AF16, has provided the first step to bridge the gap between C. elegans and C. briggsae genome resources. Currently, the QX1410 gene models consist of protein-coding gene predictions generated from short- and long-read transcriptomic data. Because of the limitations of gene prediction software, the existing gene models for QX1410 contain numerous errors in their structure and coding sequences. In this study, a team of researchers manually inspected over 21,000 software-derived gene models and underlying transcriptomic data to improve the protein-coding gene models of the C. briggsae QX1410 genome. ResultsWe designed a detailed workflow to train a team of nine students to manually curate genes using RNA read alignments and predicted gene models. We manually inspected the gene models using the genome annotation editor, Apollo, and proposed corrections to the coding sequences of over 8,000 genes. Additionally, we modeled thousands of putative isoforms and untranslated regions. We exploited the conservation of protein sequence length between C. briggsae and C. elegans to quantify the improvement in protein-coding gene model quality before and after curation. Manual curation led to a substantial improvement in the protein sequence length accuracy of QX1410 genes. We also compared the curated QX1410 gene models against the existing AF16 gene models. The manual curation efforts yielded QX1410 gene models that are similar in quality to the extensively curated AF16 gene models in terms of protein-length accuracy and biological completeness scores. Collinear alignment analysis between the QX1410 and AF16 genomes revealed over 1,800 genes affected by spurious duplications and inversions in the AF16 genome that are now resolved in the QX1410 genome. ConclusionsCommunity-based, manual curation using transcriptome data is an effective approach to improve the quality of software-derived protein-coding genes. Comparative genomic analysis using a related species with high-quality reference genome(s) and gene models can be used to quantify improvements in gene model quality in a newly sequenced genome. The detailed protocols provided in this work can be useful for future large-scale manual curation projects in other species. The chromosome-level reference genome for the C. briggsae strain QX1410 far surpasses the quality of the genome of the laboratory strain AF16, and our manual curation efforts have brought the QX1410 gene models to a comparable level of quality to the previous reference, AF16. The improved genome resources for C. briggsae provide reliable tools for the study of Caenorhabditis biology and other related nematodes.

11
The disconnect2 mutation disrupts the tjp1b gene that is required for electrical synapse formation

Michel, J. C.; Lasseigne, A. M.; Marsh, A. J.; Miller, A. C.

2022-06-29 genetics 10.1101/2022.06.27.497839 medRxiv
Top 0.1%
14.0%
Show abstract

To investigate electrical synapse formation in vivo we used forward genetics to disrupt genes affecting Mauthner cell electrical synapses in larval zebrafish. We identify the disconnect2 (dis2) mutation for its failure to localize neural gap junction channels at electrical synapses. We mapped this mutation to chromosome 25 and identified a splice-altering mutation in the tjp1b gene. We demonstrated that the dis2 mutation disrupts tjp1b function using complementation analysis with CRISPR generated mutants. We conclude that the dis2 mutation disrupts the tjp1b gene that is required for electrical synapse formation. DescriptionNeural networks are circuits of neurons wired together during development that provide an animal with specialized behavioral outputs. Dedicated adhesions called neuronal synapses create sites of communication between neurons and can be categorized as either electrical or chemical. This work identifies a new mutation in tjp1b that is shown to be required for electrical synapse formation.

12
Progress Towards Plant Community Transcriptomics: Pilot RNA-Seq Data from 24 Species of Vascular Plants at Harvard Forest

Marx, H. E.; Jorgensen, S. A.; Wisely, E.; Li, Z.; Dlugosch, K. M.; Barker, M. S.

2020-04-01 genomics 10.1101/2020.03.31.018945 medRxiv
Top 0.1%
12.8%
Show abstract

O_LIPremise of the study: Large scale projects such as NEON are collecting ecological data on entire biomes to track and understand plant responses to climate change. NEON provides an opportunity for researchers to launch community transcriptomic projects that ask integrative questions in ecology and evolution. We conducted a pilot study to investigate the challenges of collecting RNA-seq data from phylogenetically diverse NEON plant communities, including species with diploid and polyploid genomes. C_LIO_LIMethods: We used Illumina NextSeq to generate >20 Gb of RNA-seq for each of 24 vascular plant species representing 12 genera and 9 families at the Harvard Forest NEON site. Each species was sampled twice, in July and August 2016. We used Transrate, BUSCO, and GO analyses to assess transcriptome quality and content. C_LIO_LIResults: We obtained nearly 650 Gb of RNA-seq data that assembled into more than 755,000 translated protein sequences across the 24 species. We observed only modest differences in assembly quality scores across a range of k-mer values. On average, transcriptomes contained hits to >70% of loci in the BUSCO database. We found no significant difference in the number of assembled and annotated genes between diploid and polyploid transcriptomes. C_LIO_LIDiscussion: Our resource provides new RNA-seq datasets for 24 species of vascular plants in Harvard Forest. Challenges associated with this type of study included recovery of high quality RNA from diverse species and access to NEON sites for genomic sampling. Overcoming these challenges offers clear opportunities for large scale studies at the intersection of ecology and genomics. C_LI

13
Optimizing CRISPR/Cas9 mutagenesis in Drosophila da neurons to avoid cytotoxicity

Duarte, V. N.; Najafi, V.; Thompson-Peer, K. L.

2025-10-04 genetics 10.1101/2025.10.03.680356 medRxiv
Top 0.1%
12.6%
Show abstract

Genetic perturbations are one of the great strengths of the model organism Drosophila melanogaster, with approaches such as classical mutagenesis and RNA interference enabling a wealth of biological discoveries. A more recent approach for altering gene expression is CRISPR/Cas9-based mutagenesis, but as with any new tool, its use must be optimized. High expression of Cas9 has been shown to cause cytotoxicity in some cell types. Here, we show that Cas9 expression alone causes cytotoxicity in the dendritic arborization (da) neurons that are widely used to study neuronal development and regeneration. We then systematically evaluate alternative Cas9 transgenes designed to lower total Cas9 expression, called uCas9 transgenes. We show that expression of these uCas9 transgenes results in little to no cytotoxicity to da neurons. Lastly, we demonstrate the ability of uCas9 transgenes to effectively and specifically gene edit in da neuro ns. Thus, we expand the toolkit of genetic perturbations available to researchers working with Drosophila da neurons or other cell types suceptible to cytotoxicity due to high expression of Cas9.

14
CRISPR-Cas9 repair complexity in Drosophila melanogaster: NHEJ-induced deletions and HDR variability in the bantam microRNA gene

De Sousa, P.; Houbron, E.; Seitz, H.; Busseau, I.

2025-05-30 genetics 10.1101/2025.05.29.656857 medRxiv
Top 0.1%
12.5%
Show abstract

CRISPR-Cas9-mediated gene editing was used to generate specific mutants of the bantam gene in Drosophila melanogaster. To drive non-homologous end joining (NHEJ) and achieve a precise deletion of most of the bantam locus, two guide RNAs targeting sites 90 base pairs apart were expressed in the germline using the UAS/GAL4 system. Thirty lethal and eight viable lines were established and analyzed. One lethal line exhibited the expected 90 bp deletion, while the others carried diverse indels at one or both cleavage sites. Among the viable lines, seven harbored a single-nucleotide deletion that did not disrupt bantam function. Notably, one viable line, band1-44, carried a hypomorphic allele that reduced organismal size without affecting viability. To generate precisely edited bantam variants, CRISPR-Cas9-mediated homology-directed repair (HDR) was used using donor plasmids containing engineered mutations in the miRNA seed region, along with a scarless dsRED fluorescent marker. Approximately 40% of the resulting fluorescent lines were correctly edited, demonstrating the efficiency of this strategy for producing specific bantam variants. The remaining lines exhibited unexpected outcomes, among which partial HDR events, where the dsRED marker was integrated but not the bantam mutations, and full donor plasmid integrations, which led to duplication of the bantam locus. These findings reveal the complexity of CRISPR-Cas9 outcomes, emphasizing the need for thorough screening and characterization of individual candidates in gene-editing experiments. They also provide valuable insights for optimizing genome editing strategies. Article summaryThe authors used Clustered Regularly Interspaced Short Palindromic Repeats (CRISPR) gene editing to mutate the bantam microRNA in Drosophila melanogaster. They guided Non-Homologous End Joining to induce a precise 90 bp deletion. It was the rarest occurrence, while small inactivating indels occurred frequently. They employed Homology-Directed Repair using a fluorescent marker to specifically target the bantam seed region. This efficiently produced the intended mutations but also led to unexpected outcomes, including partial sequence replacements and full donor plasmid integrations. These results reveal the complexity of gene editing outcomes and highlight the importance of thorough molecular characterization in genome engineering experiments.

15
Matters Arising: Re-examining the correlations between codon usage and dihedral bond angles using a population genetics model

Akeju, O.; Cope, A. L.

2022-12-04 evolutionary biology 10.1101/2022.10.26.513858 medRxiv
Top 0.1%
12.4%
Show abstract

A recent paper by Rosenberg et al. [2022] found a surprising correlation between synonymous codon usage and the dihedral bond angles of the resulting amino acid. However, their analysis did not account for the strongest known correlate of codon usage: gene expression. We applied the approach of Rosenberg et al. [2022] to simulated protein-coding sequences that (1) maintain the general relationship between codon usage and gene expression and (2) completely random codon usage. The analysis of the simulated data assuming a general relationship between gene expression and codon usage returned results remarkably similar to the real data. More concerning was the large number of significant results detected when sequences with random codon usage were analyzed. We believe that the specific results of Rosenberg et al. [2022] were confounded by the relationship between codon usage and gene expression, but also that their method is generally prone to detecting noise in protein bond angle distributions.

16
The reference genome of an endangered Asteraceae, Deinandra increscens subsp. villosa, endemic to the Central Coast of California

McEvoy, S. L.; Meyer, R. S.; Hasenstab-Lehman, K. E.; Guilliams, C. M.

2024-02-26 genomics 10.1101/2024.02.25.582000 medRxiv
Top 0.1%
12.2%
Show abstract

We present a high-quality reference genome of the federally endangered Gaviota tarplant, Deinandra increscens subsp. villosa (Madiinae, Asteraceae), an annual herb endemic to the Central California coast. Stewards of remaining populations have planned to apply conservation strategies informed by whole genome approaches. Generating PacBio Hifi, Oxford Nanopore Technologies, and Dovetail Omni-C data, we assembled a genome of 1.67 Gbp as 28.7 K scaffolds with a scaffold N50 of 74.9 Mb. BUSCO completeness for the final assembly was 98.1% with 15.7% duplicate copies. We annotated repeat content in 74.8% of the genome. Long terminal repeats (LTR) covered 44.0% of the genome with Copia families predominant at 22.9% followed by Gypsy at 14.2%. Both Gypsy and Copia elements were common in ancestral peaks of LTR, and the most abundant element was a Gypsy element containing nested Copia/Angela sequenced similarity, reflecting a complex evolutionary history of repeat activity. Gene annotation produced 41,039 genes and 69,563 transcripts, of which >99% were functionally annotated. BUSCO duplication rates remained very high with proteins at 50.4% complete duplicates and 46.0% single copy. Whole genome duplication (WGD) synonymous mutation rates of Gaviota tarplant and sunflower (Helianthus annuus) shared peaks that correspond to the last Asteraceae polyploidization event and subsequent divergence from a common ancestor at [~]27 mya. Tandem genes were twice as prevalent as WGD genes suggesting tandem genes could be an important strategy of environmental adaptation in this species. Article SummaryWe introduce a high-quality reference genome for the endangered Gaviota tarplant. The assembly is 1.67 Gbp with 98.1% BUSCO completeness and 41 K annotated genes. We find extensive Copia long terminal repeat sequences and tandem genes that suggest environmental adaptation strategies. Comparisons with sunflower suggest a shared polyploidization event around 27 million years ago, close to the date of the common ancestor divergence. This work underlines the importance of genomic studies in accurately understanding adaptations and conservation needs.

17
A cullin-RING ubiquitin ligase promotes thermotolerance as part of the Intracellular Pathogen Response in C. elegans

Panek, J.; Gang, S. S.; Reddy, K. C.; Luallen, R. J.; Fulzele, A.; Bennett, E. J.; Troemel, E. R.

2019-10-22 genetics 10.1101/586834 medRxiv
Top 0.1%
10.5%
Show abstract

Intracellular pathogen infection leads to proteotoxic stress in host organisms. Previously we described a physiological program in the nematode C. elegans called the Intracellular Pathogen Response (IPR), which promotes resistance to proteotoxic stress and appears to be distinct from canonical proteostasis pathways. The IPR is controlled by PALS-22 and PALS-25, proteins of unknown biochemical function, which regulate expression of genes induced by natural intracellular pathogens. We previously showed that PALS-22 and PALS-25 regulate the mRNA expression of the predicted ubiquitin ligase component cullin cul-6, which promotes thermotolerance in pals-22 mutants. However, it was unclear whether CUL-6 acted alone, or together with other ubiquitin ligase components. Here we use co-immunoprecipitation studies paired with genetic analysis to define the cullin-RING ligase components that act together with CUL-6 to promote thermotolerance. First, we identify a previously uncharacterized RING domain protein in the TRIM family we named RCS-1, which acts as a core component with CUL-6 to promote thermotolerance. Next, we show that the Skp-related proteins SKR-3, SKR-4 and SKR-5 act redundantly to promote thermotolerance with CUL-6. Finally, we screened F-box proteins that co-immunoprecipitate with CUL-6 and find that FBXA-158 promotes thermotolerance. In summary, we have defined the three core components and an F-box adaptor of a cullin-RING ligase complex that promotes thermotolerance as part of the IPR in C. elegans, which adds to our understanding of how organisms cope with proteotoxic stress.\n\nSignificance StatementIntracellular pathogen infection in the nematode Caenorhabditis elegans induces a robust transcriptional response as the host copes with infection. This response program includes several ubiquitin ligase components that are predicted to function in protein quality control. In this study, we show that these infection-induced ubiquitin ligase components form a protein complex that promotes increased tolerance of acute heat stress, an indicator of improved protein homeostasis capacity. These findings show that maintaining protein homeostasis may be a critical component of a multifaceted approach allowing the host to deal with stress caused by intracellular infection.

18
A method to estimate the frequency of chromosomal rearrangements induced by CRISPR/Cas9 multiplexing in Drosophila.

Ng, W. A.; Reed, B. H.

2019-10-22 genetics 10.1101/815431 medRxiv
Top 0.1%
10.5%
Show abstract

Using CRISPR/Cas9 to simultaneously induce mutations in two or more target genes, commonly referred to as multiplexing, may result in chromosomal rearrangements such as inversions or translocations. While this may be undesirable in some contexts, the ability to recover chromosomal rearrangements targeted to specific sites in the genome is potentially a powerful tool. Before developing such tools, however, it is first important to measure the frequency with which chromosome rearrangements are induced by CRISPR/Cas9 multiplexing. To do this, we have developed a self-selecting screening system using a Drosophila line that carries an autosomal pericentric inversion in what is known as the autosynaptic form. All progeny of normal females crossed to males of this autosynaptic stock are lethal due to excessive aneuploidy. If an inversion is induced within the female germline, and if it is analogous to the inversion in the male autosynaptic line, then it is possible to recover progeny in which aneuploidy is reduced and viability is restored. Using this self-selection method, we screened 130 females and recovered one new autosynaptic element. Salivary gland polytene chromosome analysis, PCR, and sequencing confirmed the recovery of a breakpoint induced precisely between the two sgRNA target sites. Overall, we demonstrate that CRISPR/Cas9 multiplexing can induce chromosomal rearrangements in Drosophila. Also, in using this particular system, the recovery of chromosomal rearrangements was not a high frequency event.

19
Gene model for the ortholog of tgo in Drosophila miranda

Jones, G. M.; Giunta, A. A.; Mastandrea, H.; Wittke-Thompson, J. K.

2025-08-06 genomics 10.1101/2025.08.04.668519 medRxiv
Top 0.1%
10.2%
Show abstract

Gene model for the ortholog of tango (tgo) in the April 2013 (UC Berkeley DroMir_2.2/DmirGB2) Genome Assembly (GenBank Accession: GCA_000269505.2) of Drosophila miranda. This ortholog was characterized as part of a developing dataset to study the evolution of the Insulin/insulin-like growth factor signaling pathway (IIS) across the genus Drosophila using the Genomics Education Partnership gene annotation protocol for Course-based Undergraduate Research Experiences.

20
A Drosophila screen identifies a role for histone methylation in ER stress preconditioning

Owings, K. G.; Chow, C. Y.

2023-03-11 genetics 10.1101/2023.03.10.532109 medRxiv
Top 0.1%
10.0%
Show abstract

Stress preconditioning occurs when transient, sublethal stress events impact an organisms ability to counter future stresses. Although preconditioning effects are often noted in the literature, very little is known about the underlying mechanisms. To model preconditioning, we exposed a panel of genetically diverse Drosophila melanogaster to a sublethal heat shock and measured how well the flies survived subsequent exposure to endoplasmic reticulum (ER) stress. The impact of preconditioning varied with genetic background, ranging from dying half as fast to four and a half times faster with preconditioning compared to no preconditioning. Subsequent association and transcriptional analyses revealed that histone methylation, transcriptional regulation, and immune status are all candidate preconditioning modifier pathways. Strikingly, almost all subunits (7/8) in the Set1/COMPASS complex were identified as candidate modifiers of preconditioning. Functional analysis of Set1 knockdown flies demonstrated that loss of Set1 led to the transcriptional dysregulation of canonical ER stress genes during preconditioning. Based on these analyses, we propose a model of preconditioning in which Set1 helps to establish an interim transcriptional memory of previous stress events, resulting in a preconditioned response to subsequent stress. Author SummaryStress preconditioning occurs when a history of previous stresses impacts an organisms response to subsequent stresses. There are many documented cases of stress preconditioning, but the specific genes and pathways involved in the process are not well understood. Here, we take advantage of the natural genetic variation in the Drosophila Genetic Reference Panel to examine the role genetic variants play in modifying preconditioning outcomes. Our goal is to identify genes that contribute to the underlying mechanisms of preconditioning. Specifically, we measured preconditioning outcomes as the change in death rates of Drosophila on constant endoplasmic reticulum (ER) stress with and without heat stress preconditioning for each strain. We demonstrate that preconditioning outcomes are highly dependent on genetic background. Through association and transcriptional analyses, we found that histone methylation, transcriptional regulation, and immune status are all candidate pathways impacting preconditioning. Functional studies utilizing Set1 knockdown flies demonstrated that Set1, a histone H3 lysine 4 (H3K4) methyltransferase enzyme, is critical for the proper expression of a subset of ER stress genes during preconditioning. Our data indicate that Set1 likely aids in creating a transient transcriptional memory following initial stress that impacts the response to subsequent stress.